Pegasus, a workflow management system for science automation

نویسندگان

  • Ewa Deelman
  • Karan Vahi
  • Gideon Juve
  • Mats Rynge
  • Scott Callaghan
  • Philip Maechling
  • Rajiv Mayani
  • Weiwei Chen
  • Rafael Ferreira da Silva
  • Miron Livny
  • R. Kent Wenger
چکیده

Modern science often requires the execution of large-scale, multi-stage simulation and data analysis pipelines to enable the study of complex systems. The amount of computation and data involved in these pipelines requires scalable workflow management systems that are able to reliably and efficiently coordinate and automate data movement and task execution on distributed computational resources: campus clusters, national cyberinfrastructures, and commercial and academic clouds. This paper describes the design, development and evolution of the Pegasus Workflow Management System, which maps abstract workflow descriptions onto distributed computing infrastructures. Pegasus has been used for more than twelve years by scientists in a wide variety of domains, including astronomy, seismology, bioinformatics, physics and others. This paper provides an integrated view of the Pegasus system, showing its capabilities that have been developed over time in response to application needs and to the evolution of the scientific computing platforms. The paper describes how Pegasus achieves reliable, scalable workflow execution across a wide variety of computing infrastructures.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

HUBzero and Pegasus: integrating scientific workflows into science gateways

In this paper, we described the benefits and the challenges of integrating existing scientific workflow technologies into science gateways. Scientific workflow managers are powerful tools for handling large computational tasks. Domain scientists find it difficult to create new workflows, so many tasks that could benefit from workflow automation are often avoided or performed by hand. Two techno...

متن کامل

Bringing Scientific Workflow to the Masses via Pegasus and HUBzero

Scientific workflow managers are powerful tools for handling large computational tasks. Domain scientists find it difficult to create new workflows, so many tasks that could benefit from workflow automation are often avoided or done by hand. Two technologies have come together to bring the benefits of workflow to the masses. The Pegasus Workflow Management System can manage workflows comprised ...

متن کامل

A Taxonomy on Tools for Scientific Workflow Management System

Scientific workflow management systems (SWFMSs) have been shown important to scientific computing and services computing [4][5][6][7] as they provide functionalities such as work flow determination, process coordination, job scheduling and execution, provenance discover and error resistance. Systems such as Pegasus [11], Taverna [8], Swift [12] ,Vistrails [10], Kepler [9] have seen wide accepta...

متن کامل

Workflow Management in Cloud Computing

Cloud computing is a paradigm that provides demand service resources like software, hardware, platform, and infrastructure. Under cloud environment, workflow is an emerging technique for future scalable applications. This paper discusses the various tools for generating workflow and these tools have been compared on the basis of operating system, databases, architecture and so on. The applicati...

متن کامل

Pegasus and DAGMan From Concept to Execution: Mapping Scientific Workflows onto Today's Cyberinfrastructure

In this chapter we describe an end-to-end workflow management system that enables scientists to describe their large-scale analysis in abstract terms, then maps and executes the workflows in an efficient and reliable manner on distributed resources. We describe Pegasus and DAGMan and various workflow restructuring and optimizations they perform and demonstrate the scalability and reliability of...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Future Generation Comp. Syst.

دوره 46  شماره 

صفحات  -

تاریخ انتشار 2015